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Appendix m 
Nonstandard Matrix Multiplication 

Wave packets. Define wave packets over in the usual way. For a pair P = {pi},Q = 
{Qi} of quadrature mirror filters (QMFs) satisfying the orthogonality and decay con- 
ditions stated in [CW], there is a unique solution to the functional equation 

<l>{t) = V2j^Pi<l>{2t^j). 
Put w = WQfl^o = ^, and define recursively 

tt;2fH-i,o.o(<) = ^/2^gjWn,oM^t - jh 

i€Z 

Then set w^mkit) = 2"'/2^nOo(2"'* - Write W(R) = {wnmk : n,m, k e Z} for the 
collection of functions so defined, which we shall call wave packets. 

The quadrature mirror filters Q may be chosen so that >V(R) is dense in many 
common function spac^. With the minimal hypotheses of [CW], W(R) will be dense 
in L^{Ii). Using the Haar filters P = {1/^2,1/ ^/2},Q = {l/x/2, -l/\/2} produces 
W(R) which is dense in I^(R) for 1 < p < oo. Longer filters can generate smoother 
wave packets, so we can also produce dense subsets of Sobolev spaces, etc. 

Basis subsets. Define a basis subset a of the set of indices {(n,m,fc) € Z^} to be 
any subcoUection with the property that {wnmk ' (ti, m, /:) 6 (t} is a Bilbert basis for 
jD^(R). We characterize basis subsets in [Wl]. Abusing notation, we shall also, refer to 
the collection of wave packets {wnmk • (n,7n,A:) € a} as a basis subset. 

Since n is dense in for 1 < p < oo, with certain QMFs a basis subset will 
also be a basis for L^, Likewise, for nice enough QMFs, it will be a Hilbert basis for 
the various Sobolev spaces. 

Since i^(R) ® L^(R) is dense in L^{B?). the collection of vectors {wx^wy : wx € 
W{X),WY e is dense in the space of Hilbert-Schniidt operators. Call a C a 
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basis subset if the collection {wnxmxkx ^'^nymYky : («Xi"^;c>^A'j"yi"iy, fey) e a} 
forms a Hilbert basis. Such two-dimensional basis subsets are characterized in [W2]. 

Ordering wave packets. Wave packets Wnmk can be totally ordered. We say that 
w <w' if (m, n, k) < {m\ n', A;'). The triplets are compared lexicographically, counting 
the scale parameter m as most significant. 

Tensor products of wave packets inherit this total order. Write wx = '^nsmxkx^ 
etc. Then we will say that wx ® tt/y < v^x ® ^y < ^x^ ^® ^ = y^x 
wy < xvy. This is equivalent to 

{mx.nxjkx.mY.nY.ky) < (mx,n';^,&X'"^y»^y»^y)i 

comparing lexicographically from left to right. 

Define the adjoint order <* by exchanging X and Y indices, namely wx ® w'y <* 
w'^ ® Wy if and only if wy ® wx <* y^y ® "^x- is also a total order. 

Projections. Let denote the space of bounded sequences indexed by the three^ 
wave packet indices n, m, k. With the ordering above, we obtain a natural isomorphism 
between 1°^ and There is also a natural injection : L^CR) <^ given by 
Cr^jnk = {v.Wnmk), for v € i^(R) and u;„mfc € >V(R). If a is a basis subset, then 
the composition of with projection onto the subsequences indexed by a is also 
injective. is an isomorphism of L^(R) onto Z^(o-), which is defined to be the square 
summable sequences of whose indices belong to <7. 
We also have a map 72^ : — > ^^(R) defined by 

(n,m.fc)€Z» 

This map is defined and bounded on the closed subspace of isomorphic to tmder 
the natural isomorphism mentioned above. In particular. B} is defined and bounded 
on the range of for every basis subset a. The related restriction R\ : 
L'^{B) defined by Rlc{t) - E(n,m,A:)6t7 <^mkWnmk{t) is a left inverse for and J^. In 
addition, J^JiJ is a projection of Likewise, if ^^-Oi = 1 and Rl, is one of the 
above maps for each i, then Yli ^i^li ^ ^ projection of It is an orthogonal 
projection on any finite subset of W^- 

Similarly, writing for x W^, the ordering of tensor products gives a natu- 
ral isomorphism between l"^ and W^. The space L^(R^). i.e.. the Hilbert-Schmidt 
operators, inject into this sequence space in the obvious way, namely M i-* 
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{M,Wj,^rnxkx ® tWnymrJby)- Call tlus iiijection J^. If a is a basis subset of W^, 
then the coinposition of with projection onto subsequences indexed by a is also 
injective. is an isomorphism of L^(R^) onto /^(a), the square summable sequences 
of yv^ whose indices belong to a. 

The map iW^ ^ L^iB?) given by R^c{x,y) = J^cxYWx{x)wY{yl is bounded 
on that subset of naturally isomorphic to In particular, it is bounded on the 
range of for every basis subset a. 

We may also define the restrictions JZ^ of to subsequences indexed by cr. defined 
by Rlc{x,y) = 52(ti,x,ti;i')€<T <^XY'^x{x)wY{y). There is one for each basis subset a of 
W^. Then iJ^ is a left inverse of and J^. and J^Rl is a projection of W^. As before. 
^ St ^ = 1 said a* is a basis subset for each i, then J2i ^ ^Iso a projection of 
W^, It is an orthogonal projection on any finite subset of W^. 

Applying operators to vectors. For definiteness, let X and Y be two named copies 
of R. Let V e L^{X) be a vector, whose coordinates with respect to wave packets form 
the sequence J^v — {{v, wx) : v^x € W{X)}. 

Let M : L^{X) L'^(Y) he a Hilbert-Schmidt operator. Its matrix coefficients 
with respect to the complete set of tensor products of wave packets form the sequence 
J^M = {{M,wx ®v)y)iwx& W{X),wy € W[Y)}. We obtain the identity 

{Mv, wy)- ^ {M,wx®wy) (v, Wx) 
This identity generalizes to a linear action of on defined by 

Now. images of operators form a proper submanifold of W^, Likewise, images of vectors 
form a submanifold W^. We can lift the action of M on v to these larger spaces via 
. the commutative diagram 

L2(R) . L2(R) 

M 

The significance of this lift is that by a suitable choice of a we can reduce the complexity 
of the map J^M, and therefore the complexity of the operator application. 
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Composing operators. Let X,Y,Z be three named copies of R. Suppose that 
M : PiX) L^{Y) and N : P{Y) L'^{Z) are Habert-Schmidt operators. We 
have the identity 

This generalizes to an action of on W^, which is defined by the formula 

where c and d are sequences in W^- Using J^, we can lift multiplication by iV to an 
action on these larger spaces via the commutative diagram 

i2(R2) j^2(R2) 

Again, by a suitable choice of a the complexity of the operation may be reduced to 
below that of ordinary operator composition. 

Operation counts: transforming a vector. Suppose that M is a non*sparse opera- 
tor of rank r. Ordinary multiplication of a vector by M takes at least 0{t^) operations, 
with the miTiimnm achievable only by representing Af as a matrix with respect to the 
bases of its r-dimensional domain and range. 

On the other hand, the injection will require C)(r^ [log r]^) operations- and each 
of and i?^ require O(rlogr) operations. For a fixed basis subset a of W^, the 
application of J^M to J^v requires at most #| J^Af] operations, where #|Z7| denotes 
the number of nonzero coefllcients in U. We may choose our wavelet library so that 
#|J^Af| = 0{r^). Thus the multiplication method described above costs an initial 
investment of 0(r^[logr]^), plus at most an additional 0{r^) per right-hand side; Thus 
the method has asymptotic complexity O(r^) per vector in its exact form, as expected 
for what is essentially multiplication by a conjugated matrix. 

We can obtain lower complexity if we take into account the finite accuracy of our 
calculation. Given a fixed matrix of coefficients C. write Cs for the same matrix with 
all coefficients set to 0 whose absolute values are less than S, By the continuity of the 
Hilbert-Schmidt norm, for every e > 0 there is a 5 > 0 such that \\C-Cs\\hs < Given 
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M and e as well as a library of wave packets, we can choose a basis subset a C 
so as to minimize #|(J^Af)tf|. The choice adgoiithm has complexity 0(r2[Iogr]2), as 
shown in [W2]. For a certain class of operators, there is a Hbraiy of wave packets sucli 
that for every fixed S >Owe have 

(S) mi4M)s\ = Oirlogr), 

with the constant depending, of course, on 6. We \nll characterize this class, give 
examples of members, and find useful sufficient conditions for membership in it. For the 
moment, call this dass with property S the spaisiSahle ffilbert-Schmidt operators S. 
By the estimate above, finite-predsion multiplication by sparsifiable rank-r operators 
has asymptotic complexity O(rlogr). 

Operation counts: composing two operators. Suppose that MaadN aie rauk-r 
operators. Standard multiplication of iV and M has complexity 0{r^). The complexity 
of injecting JV and M into is 0(r2[logr]2). The action of J^N on J^M has com- 
plexity 0(i:„„jfc#|4iVy^ : inY,mY,ky) = {n,m, J^M^y : (ny.my,iLv) = 
(n.m,fc)|). The second factor is a constant rlogr, while the first when summed over 
aU nmk is exactly #|J2-W|. Thus the complexiiy of the nonstandard mtUtipKcation 
algorithm, induding the conjugation into the baas set a, is rlogr). Smce 

the first factor is r^ in general, the complexity of the exact algorithm is 0{r^ logr) for 
generic matrices, reflectmg the extra cost of conjugating into the basis set a. 

For the approximate algorithm, the complexity is 0(#|(J|JV)5| rlogr). For the 
sparsifiable matrices, this can be reduced by a suitable choice of a to a complexity 
of 0(r'[logr]2) for the complete algorithm. Since choosing a and evaluating eadi 
have this complexity, it is not possible to do any better by this method. 
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Appendix IV 
Entropy of a Vector Relative to a Decomposition 

Let V e E \\v\\ = 1 and assume 

an orthogonal direct sum. We define 

eHv.{Si}) = -J2\\vi\\Hu\\vif 

as a measure of distance between v and the orthogonal decomposition. 

is characterized by the Shannon equation which is a version of Pythagoras' the- 
orem. 
Let 

-ff* and Hj give orthogonal decomposition S+ —J^IP = ]^ JTy. Then 
eHvr{h\hj}) = ^K{5+,H_} + K||V +||,_„V 

This is Shannon's equation for entropy (if we interpret as in quantum mechanics 
ll-Pff+^^ll^ as the ''probability" of v to be in the subspace J?+). 

This equation enables us to search for a smallest entropy space decomposition of a 
given vector. We need the following H — Hi B 

Hi has two decompositions in IT* or K^. 
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Lemma 1. Let v £H with \\v\\ and 



ll^ill 

Assume also tb&t e^(v{,{H^}) < e^(v{, {K*}) then, ifEi = © 2^' we Jiave 

^{v,{W,U})<e'{y,{KiM}) 

Proof. By Shannon's equation 

e\v, {H", L^}) = e'{v, {^1,^2}) + ||t;i|| V^, {^}) 
+ 11^211 V(t;^,{i^'}) 

< 6^{v, {H^,H2}) + IhllVK^iiTi}) + ||t;2l|VK,{L^}) 

Corollary. Assume e^{v[^{H^}) is the smallest entropy obtained for some collection 
of decompositins of Hi and mnilarly, e^{v2, {I^}) is nmiimal. Then 6^{v; {H*,L^}) is 
minimal for the direct sum of these collections. 

We consider the following generic example on L^(R+). 

Let / denote a dyadic interval of the form (2^*/:,2^*(fc + 1)] , 7 > 0 A; > 0, and {1^} a 
disjoint cover over (0, 00) consisting of dyadic intervals. We let Hj^ = L^{Ia) on which 
we chose an orthonormal basis {e^^^} a fixed (say trig polynomials e}:p{2m^)xu (^)O 
and consider {e^'^t} ^ ^n orthonormal basis of L^(R+). Thus 

a ib 

Given a yector v we wish to find la such that e^{v, {ca^}) is minimal. In order 
to find Ice we use a stopping time argument. Starting with intervals of length one 
li = + 1]. We pick a dyadic interval of length two which contains halves /1J2 of 
length one, i.e. J = /i U I2. We compart 

^ivXjA4}) with e^{vxM4'M'}) 

and pick the basis given the smallest entropy leading to a cover of L^(R) by intervals 
of length one and two. We now consider dyadic intervals K of length 4 and compaae 

e^{vXK{ef}) with e^{vxK;{e^^}) 
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where la form a cover of K by dyadic intervals of length one or two selected previously 
to minimize on each half of K. 

(If the vector function v has bounded support we restrict our attention only to 
dyadic intervals contained in the smallest dyadic interval containing the support of v) 
and continue this procedure up to this largest scale. We claim that the final partition 
Ja and corresponding basis provides .the minimal entropy decomposition. In fact, this 
is an immediate consequence of Lemma 1 which shows that given the optimal minimum 
entropy partition any refinement corresponds to worse entropy. 
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Appendix V 
Higher-Dimensional Best Basis Selection 

Introduction. We introduce a method of coding by orthogonal functions which may 
be used to compress digitized pictures or sequences of pictures, matrices and Uneai- 
operators, and general sampled functions of several variables. The method selects a 
most efficient orthogonal representation from among a large number of possibilities. 
The efficienqr functional need only be additive across direct sum decompositions. We 
present a description of the method for pictures, namely functions of two variables, 
using Shannon entropy as the efficiency functional, and mean-squaie deviation as the 
error criterion. 

Best basis method. In Appendix H is developed a method for generating a Ubraiy 
of orthogonal vectors in R" (for large n) together with a notion of admissible subsets 
of these vectors. Admissible subsets form orthonormal bases of wavelet-packets, whidi 
because of their homogeneous tree structure may be rapidly searched for functional 
extrema. We can use a family of orthonoraial vectors well suited to representing 
functions of 2 variables. These are products of quadrature mirror filters, as defined 
below: 

Let {pfc belong to and define two decimating convolution operators P : 

/2-*/2 Q:i2-./2 as follow. 

■P/*= Pi/i+2&, Qfk= 53 ?i/i+2fc- 

3——0O jss—oo 

■ P and Q are called quadrature mirror Slters if they satisfy an orthogonality ponditiou: 

PQ* = QP' = 0. 

where P* denotes the adjoint of P, and Q* the adjoint of Q. They are further called 
perfect reconstruction filters if they satisfy the condition 

P'P + Q*Q = I, 
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where I is the identity operator. These conditions translate to restrictions on the 
sequences {pjb}, Let mo, mi be (bounded) functions defined by 

Then P, Q are quadrature mirror filters if and only if the matrix below is unitary for 
all^: 

(moiO moi^ + ^)\ 

This fact is proved in [D]. 

Now we can define a number of orthogonal 2-dimensionaI convolution-decimation 
filters in terms of P and Q. Four of them are simply tensor products of the pair of 
quadrature mirror filters, as in the construction of 2-dimensional wavelets of Meyer 
[M]. 



1^ P ® P, FoV{x, y) = '^v{i,j )p2x+iP2y+j 

Pi W P ® Q, Fiv{x, y) = E «(».i)P2*+i92y+i 

id 

Pa ^ C? ® Q, Fzvix, y) = E «(*'i)P2x+tP2y + j 



There are also pairs of extensions of one dimensional filters: 

»J 3 

Qy 1^ / ® Qyvix, y) = E 3)Sx,iq2y+j = E 3)Q2y+j 

i.3 3 

PX^P^L PXV{X, y ) = 5Z 3 )P2x+iSy,3 = E y)P2x+i 

id * 

Qx = Q®I, Qxvix, y) = E i)92x+,5vj = E 2')92x+i 
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These convolution-dedmations have the following adjointi 



s: 



«i7 



ij j 



The ortliogpnaJity relations for this collectioii are as follows: 

-T = F^Fo ® i^f 1 ® f;F2 © Fa'f 3 
■PxrPy = PyP^ = QxQ*x = QrQy = / 
FxQ*x = QxPx = PyQY = QYP^ = 0 
I = PxPx ® QxQx = PyPy e Q*yQy 

By a 'picture" we will mean any function 5 = S{x,y) e fi{Z^). The space 
1^{Z^) of pictures may be decomposed into a partially ordered set W of subspaces 
W{nx,nY,mx.mY), where mx > 0, my > 0, 0 < rix < 2""-^'. and 0 < ny < 2"*i". 
These are the images of orthogonal projections composed of products of convolution- 
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decimations. Put W(0, 0, 0, 0) = and define recursively 

W{2nx + i, 2ny + j, mx + 1, my + 1) = i^2* +j^2i+jT^(nA:, ny , mx, my), 
lJr(2nx,ny,m;c + l^my) = PxPxW{nx,nY,mx,mY), 
W{2nx + hnY,mx +hmY) = Qj^C3;cW^(7ix,ny,mA:,my), 
W^(nx,2ny,mx,my + 1) = PyiVW^(njc,ny,mA:,my), 

W{nx,2nY + l,mx,mY + l) = QxQxW{nx,nYjmx,mYy 

These subspaces may be partiaJIy ordered by a relation which we define recursively as 
well. We say W is a precursor of W (write W ^ W) if they are equal or if PF' = G* GW 
for a convolution-dedmation G in the set {FQyFi,F2^F3,Px,PY,Qx,QY}- We also 
say that 1^ :^ W if there is a fijiite sequence Vi, . . . , of subspaces in W such that 
W :<Vi :< ' :<Vn :< W. This is well defined, since each application of G*G increases 
at least one of the indices nix or my. 

While { W, -<} is not a tree, it may be made into a tree if we select a subset of 
the relation ;:<. We will say that W = W(nx,ny,mj^,my) is a prmcipal precursor of 
W = ,n^,7njif,my) (and write W -< W) if one of the following holds: 

(1) TTiY^mx. and W' = G*GW for G e {Fo.FuFi^Fs.Px.PyyQx.Qv}. or 

(2) my < mxn and W = G^GW for G € {Px, Qx}^ or 

(3) my > mx. aad TF' = G*GW for C? e {JVi Qy}. 

Further, we will say that -< W if there is a finite sequence Vb, . . . , Vii of subspaces 
in W with P7 X Vb ^ ^ K W\ The relation ^ is well defined, since it is a 
subrelation of :;<, and it is not hard to see that every subspace W eW has a unique 
first principal precursor. Therefore, {W, forms a (nonhomogeneous) tree, with 
T7(0, 0,0,0) at its root. 

Subspaces of a single principal precursor W eW will be called its children. By the 
orthogonality condition, 

(F) W = F^FoW e F^FiW 0 F;F2W ® F^F^W 

(X) ^P^PxWeQ^QxW 
(Y) ^P^PyWqQ^QyW. 

The right hand side contains all the children of W. divided into the groups "F," "X." 
and 'T." Each labelled group of children provides a one-step orthogonal decomposition 
of W. and in general we will have three subsets of the children to choose from. 
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The coordinates with respect to the standard basis of W{nx, ny, mx, my) form the 
sequence Gi . . . G^W{0, 0, 0, 0), where m = maxfrnjc, my}, and the particular filters 
are determined uniquely by nx and ny. This is described in Appendix 
n, attaxied hereto. Therefore we can express in standard coordinates the orthogo- 
nal projections of Pr(0, 0,0,0) onto the complete tree of subspaces W by recursively 
convolving and decimating with the filters. 

Relation with one-dimensional wave packets. Let w^mk be a one-dimensional 
wave packet at sequency n, scale m and position fc, in the notation of Appensix II. 
Then the element in the (fcjf.fcy) position of the subspace W{nx,nY,mx.mY), at 
the index (a:,y), may be written as w^^,mx,kAx)tv,,y,my,ky{yh which is evidently 
the tensor product of two one-dimensional wave packets. This is easily seen from the 
construction of W{nx,nY,mx,TnY): in the x-direction, there will be a total of vix 
convolution-dedmations in the order determined by nx, with the result translated to 
position kx, and similarly in the y-direction. 

We will use the notation iu ® v for the tensor product of two one-dunensional wave 
packets, with the understanding that the second factor depends on the y-coordinate. 
Since the one-dimensional wave packets are themselves a redundant spanning set, theii- 
tensor products contain a redundancy of bases for l^R\ We can search this coUection 
of bases efficiently for a best-adapted basis, using any additive measure of information, 
in a manner only sUghtly more compHcated than for the one dimensional case. 

Selecting a best basis. Let S = S{x, y) be a picture, and let W be a tree of wavelet 
packets. Choose an additive measure of information as described in Appendix H. and 
attribute to each node W{nx,nY,mx,mY) the measure of information contained iu 
the coordinates of 5 with respect to the wavelet packets it contains. For example, we 
may use Shannon entropy, 

where p = (5, Wn^mykx ® w^y^ykyh andW = W{nx, mx^ny, my). We will choose 
an arbitrary maximum level in the tree W. and mark all of its nodes as "kept." Pro- 
ceeding up from this level to the root, we will compare HiW) for a node W of the tree 
W to the minimum of Zw^weF nw% Ew^w^x '^^iW). and Zw^w^y n{W'). 
If U{W) is less, then mark W as "kept" and mark as "not kept" aU nodes W with 
W ■< W; otherwise, maikW as "not kept," but attribute to it the minimum of the en- 
tropies of its children. When this procedure terminates at the root, the nodes marked 
"kept" will comprise an orthogonal collection of wavelet packets. 
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It is not necessary to mark all descendants of a "kept" parent as not kept. The 
complexity of the search algorithm is 0(n log n) if we never change the status of de- 
scendants, but instead take for the orthogonal collection only those nodes marked 
"kept" which have no ancestors marked "kept." These may be listed efficiently by 
indexing the tree in the preorder or depth-first order. 

Error estimates for the best basis. Let H{S) denote the entropy of the picture 
S in the best basis found above. This quantity will be found attributed to node 
W(0, 0, 0, 0) at the end of the search. It is related to the classical Shannon entropy Hq 
by the equation 

no{S) = \\sr'n{s) + iog\\sf 

The largest exp7{Q{S) = ||5|p expW(5)/||5|p terms of the wavelet packet expansion 
for S contain essentially all the energy of the original picture. Mean square enor 
bounds for specific classes of signals are provided in Appendix IV. 
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CLAIMS ; 

1. A method for encoding and decoding an input signal, 
comprising the steps of: 

applying combinations of dilations and translations of 
a wavelet to the input signal to obtain processed values; 

computing the information costs of the processed 

values ; 

selecting, as encoded signals, an orthogonal group of 
processed values, the selection being dependent on the computed 
information costs; and 

decoding the encoded signals to obtain an output 

signal. 

2. The method as defined by claim 1, wherein said wavelet 
has a plurality of vanishing moments. 

3. The method as defined by claim 1 or 2, further 
comprising transmitting the encoded signals, and receiving the 
transmitted encoded signals before the decoding thereof. 

4. The method as defined by claim 3, further comprising 
storing the encoded signals, and reading the stored encoded 
signals before the decoding thereof. 

5. The method as defined by claim 2, wherein said step of 
applying combinations of dilations and translations of the 
wavelet to the input signal to obtain processed values comprises 
correlating said combinations of dilations and translations of 
the wavelet with the input signal. 

6. The method as defined by claim 2 or 5, wherein 
combinations of dilations and translations of the wavelet are 
designated as wavelet-packets, and wherein the step of applying 
wavelet-packets to the input signal to obtain processed values 
includes; generating a tree of processed values, the tree having 
successive levels obtained by applying to the input signal, for a' 
given level, wavelet-packets which are combinations of the 
wavelet-packets applied at a previous level. 

7. The method as defined by claim 6, wherein the steps of 
computing information costs and selecting an orthogonal group of 
processed values includes performing said computing at a number 
of different levels of said tree, and performing said selecting 
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from among the different levels of the tree. 

8. The method as defined by claim 2 or 7, wherein said 
step of selecting an orthogonal group of processed values 
comprises selecting an orthogonal group having a minimal 
information cost. 

9. The method as defined by claim 8, wherein said step of 
selecting an orthogonal group of processed values includes 
generating encoded signals which represent said processed values 
in conjunction with their respective locations in said tree. 

10. A method for encoding an input signal, comprising the 
steps of: 

selecting a wavelet having a plurality of vanishing 

moments ; 

applying combinations of dilations and translations of 
the wavelet to the input signal to obtain processed values; and 

selecting/ as encoded signals, an orthogonal group of 
processed values. 

11. The method as defined by claim 10, further comprising 
decoding the encoded signals. 

12. The method as defined by claim 10 or 11, further 
comprising transmitting the encoded signals, and receiving the 
transmitted encoded signals before the decoding thereof. 

13. The method as defined by claim 10, wherein said step 
of applying combinations of dilations and translations of the 
wavelet to the input signal to obtain processed values comprises 
correlating said combinations of dilations and translations of 
the wavelet with the input signal. 

14. A method for encoding an input signal, comprising the 
steps of: 

applying combinations of dilations and translations of 
*a wavelet to the input signal to obtain processed values r . 

computing the information costs of the processed 
values; and 

selecting, as encoded signals, an orthogonal group of 
processed values, the selection being dependent on the computed 
information costs. 

15. The method as defined by claim 14, wherein said 
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wavelet has a plurality of vanishing moments. 

16. The method as defined by claim 15, further comprising 
transmitting the encoded signals, and receiving the transmitted 
encoded signals before the decoding thereof. 

17. The method as defined by claim 15, further comprising 
storing the encoded signals. 

18. The method as defined by claim 15, wherein said step 
of applying combinations of dilations and translations of the 
wavelet to the input signal to obtain processed values comprises 
correlating said combinations of dilations and translations of 
the wavelet with the input signal. 

19. The method as defined by claim 15 or 18, wherein 
combinations of dilations of the wavelet are designated as 
wavelet-packets, and wherein the step of applying wavelet-packets 
to the input signal to obtain processed values includes: 
generating a tree of processed values, the tree having successive 
levels obtained by applying to the input signal, for a given 
level, wavelet-packets which are combinations of the 
wavelet-packets applied at a previous level. 

20. The method as defined by claim 19, wherein the steps 
of computing information costs and selecting an orthogonal group 
of processed values includes performing said computing at a 
number of different levels of said tree, and performing said 
selecting from among the different levels of the tree. 

21. The method as defined by claim 15, wherein said step 
of selecting an orthogonal group of processed values comprises 
selecting an orthogonal group having a minimal information cost. 

22. The method as defined by claim 20, wherein said step 
of selecting an orthogonal group of processed values comprises 
selecting an orthogonal group having a minimal information cost. 

23. The method as defined by claim 22, wherein said step 
of selecting an orthogonal group of processed values includes 
generating encoded signals which represent said processed values 
in conjunction with their respective locations in said tree. 

24. For use in a system which receives an encoded signal 
obtained by: applying combinations of dilations and translations 
of a wavelet having a plurality of vanishing moments to the input 
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signal to obtain processed values; and selecting, as encoded 
signals, an orthogonal group of processed values; a decoding 
method comprising: sequentially applying combinations of 
dilations and translations of a wavelet having a plurality of 
vanishing moments to the encoded signals to decode said encoded 
signals; and outputting the decoded result. 

25. The decoding method as defined by claim 24, wherein the 
encoded signals include identification of the selected orthogonal 
group of processed values, and further comprising the step of 
determining said identification, and performing the sequential 
decoding procedure in accordance with said identification. 

26. Apparatus for encoding an input signal, comprising: 
means for applying combinations of dilations and 

translations of the wavelet to the input signal to obtain 
processed values; 

means for computing the information costs of the 
processed values; 

means for selecting, as encoded signals, an orthogonal 
group of processed values, the selection being dependent on the 
computed information costs. 

27. Apparatus as defined by claim 26, wherein said 
wavelet has a plurality of vanishing moments. 

28. Apparatus as defined by claim 27, wherein said means 
for applying combinations of dilations and translations of the 
wavelet to the input signal to obtain processed values comprises 
means for correlating said combinations of dilations and 
translations of the wavelet with the input signal. 

29. Apparatus for encoding an input signal, comprising: 
means for applying combinations of dilations and 

translations of a wavelet having a plurality of vanishing moments 
to the input signal to obtain processed values; and 

means for selecting, as encoded signals, an orthogonal 
group of processed values. 

30. For use in a system which receives an encoded signal 
obtained by: applying combinations of dilations and translations 
of a wavelet having a plurality of vanishing moments to the input 
signal to obtain processed values; and selecting, as encoded 
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signals/ an orthogonal group of processed values; a decoding 
apparatus comprising: means for sequentially applying 
combinations of dilations and translations of a wavelet having a 
plurality of vanishing moments to the encoded signals to decode 
said encoded signals; and means for outputting the decoded 
result • 
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COMPRESSION AND DECOMPRESSION OF DOCUMENTS AND THE LIKE 
TECHNICAL FIET.n o p THE TWyTripTj OM 

This invention is in the field of data compression, 
and more specifically is in the field of data compression 
of documents and similar images that may be represented 
as digital data. 

BACKGROUND OP T^B INVRI^ TOM 

The increase in computational speed achieved by 
modem computer technology has enabled the widespread use 
of electronic techniques for the communication and 
storage of documents of many types. For purposes of this 
application, the term "documents" will refer to two- 
dimensional representations or renderings of a nature 
that are conventionally made on a paper medium or on a 
graphics display by writing, printing, typing, drawing, 
operation of a computer-aided draw, paint or design 
program, or other similar conventional techniques, and 
those renderings that are commonly stored on paper, 
microfilm, microfiche, or electronically. Examples of. 
such documents include, without limitation thereto, text 
documents, bank checks and other banking transaction 
records, vital records, maps, charts, printed works 
(including combinations of text and graphics) , seismic 
plots, medical records, bank and insurance records, 
directories, and the like. As is well known to the 
public, examples of conventional electronic communication 
and storage of documents includes modern digital 
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facsimile ("fax") equipment, CD-ROM storage and 
distribution of encyclopedias and other series of books, 
electronically or magnetically stored representations of 
transactions such as checks and other banking 
5 transactions and statements, and the like. 

Electronic communication and storage is made quite 
difficult for those documents in which a significant 
portion of the information contained in many documents is 

10 in the form of graphic information which cannot readily 
be coded into computer- readable form on a character-by- 
character basis. For example, while the letters in a 
person's signature may readily be electronically 
communicated by way of an ASCII coded representation on a 

15 character -by-character basis, the retrieval of the ASCII 
coded representation will not provide any information 
regarding the appearance of the signature as made by the 
person. To electronically communicate or store documents 
with graphic information, the conventional technique is 

20 to digitize the document into a bit-map representation, 

with each memory location in the bit map corresponding to 
an elemental position of the document, and with the 
contents of the memory locations corresponding to the 
color at that location. In the case of a "binary" 

25 digital document representations, for example, the 

digital value "0" may be used to represent the color 
black and the digital value "1" may be used to represent 
the color white. 

30 Of course, it is highly desirable that the 

digitizing of documents be performed with as high a 
resolution as possible, which requires the use of a 
significant amount of memory for each document page. For 
example, current high resolution digitization devotes on 

35 the order of greater than one million bits to store a 

binary bit map representation of a single monochromatic 
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(black and white) document page divided into 1024-by-l024 
picture elements, or "pixels". On a 300 dpi laser 
printer, this Ik-by-lk picture will be printed on a paper 
area of approximately three inches by three inches . 
Therefore, even finer resolution may be required for 
larger high quality pictures to be printed, and also for 
certain types of documents, such as fingerprint files 
utilized by law enforcement agencies. 

These high resolution digitally stored 
representations of documents obviously not only occupy a 
great deal of computer memory and hard disk storage 
space, but also require significant time for their 
transmission over conventional communication lines. As 
such, conventional data compression techniques are highly 
useful in the electronic communication and storage of 
documents. Conventional data compression techniques are 
generally referred to as of either "lossless" or "lossy", 
depending upon whether data is discarded in the 
compression process. For most digitized documents with 
graphics information, lossy data compression techniques 
may be used, so long as the receiver is still able to 
distinguish the graphic image with acceptable clarity. 

A survey of conventional lossy data compression 
techniques may be found at Simon, "How Lossy Data 
Compression Shrinks Image Files", PC Maoazir^i^ (July 
1993), pp. 371 et seq. A popular one of these 
conventional lossy data compression techniques is 
referred to as the JPEG (Joint Photographic Experts 
Group) method. A description of this technique may be 
found in Bamsley and Kurd, Fractal imaoe Compyp^ci/^p (ak 
Peters. Ltd.. 1993), pp. 219-228. The JPEG compression 
method initially divides the image into blocks of pixels, 
and a Discrete Cosine Transform (DCT) is performed on 
each pixel block, producing a representation of the block 
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as coefficients corresponding to frequencies and 
amplitudes, rather than corresponding directly to color 
information. These coefficients are then quantized, or 
rounded off, and a difference algorithm is performed over 
5 all quantized blocks in the image, in a selected scan 
order. This difference algorithm subtracts a DC terra 
corresponding to the mean pixel value of a block, from 
the DC term of the preceding block. The difference 
coefficients are then scanned in a different order, such 

10 as a zig-zag order, and the non-zero coefficients (i.e., 
blocks in which a difference from the preceding block 
occurred) are coded to indicate the number of preceding 
zero coefficients (i.e., the number of pixel blocks in 
which no change occurred) and also the value of the non- 

15 zero difference. Lossless compression is then often 

applied to the coded result to further compress the data. 
Decompression is performed by reversing the compression 
process, producing the displayable image. 

20 Another conventional method of lossy video image 

compression, referred to as Recursive Vector Quantization 
(RVQ) , quantizes the pixel blocks directly, without a DCT 
or other transform, according to a set of selected 
reference tiles. See Simon, July 1993, op, cit . . The 

25 reference tiles are selected according to an. iterative 
technique, based upon the accuracy of the results 
relative to the original image. As noted in the Simon 
article, compression according to the RVQ method is 
computationally intense and complex, but decompression 

30 can be done quite rapidly. 

Another type of conventional lossy video image 
compression techniques is referred to as fractal 
compression. As is well known in the art, a fractal is a 
35 mathematical image object that is self -similar , in that 
the image can be represented in terms of other pieces of 
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the image. In fractal image compression, the input image 
is similarly divided into pixel groups, or tiles. Each 
tile is then approximated by a transformation 
(contractive, rotational, or both) of one or more other 
reference regions of the image. The compressed image 
thus consists of a full representation of the reference 
region, plus the transformation operators for each of the 
tiles. Each tile of the image is decompressed by 
performing a transformation of the reference region using 
the stored transformation operator for that tile. 
Detailed descriptions of conventional fractal image 
compression techniques and systems for performing the 
same may be found in Barnsley & Kurd, Fractal Tmaa^ 
C<?mprgSPiop (AK Peters, Ltd., 1993), in U.S. Patent No. 
4,941,193, and in U.S. Patent No. 5,065,447. 

Wavelet -based techniques are also known to be used 
for the data compression of digitally stored documents. 
An summary of this technique, as applied to the 
compression of digitized fingerprint files used in law 
enforcement, is described in Bradley and Brislawn, "The 
Wavelet/Scalar Quantization Compression Standard for 
Digital Fingerprint Images", Tec. Rep. LA-OR-94-827, 
Pyoc. IEEF; TSrft,S-94 (IEEE, 1994) . This approach utilizes 
the well-known Daubechies wavelet, applied to the 
document data by way of a table look-up technique. The 
Daubechies wavelet function used in this case is not 
bounded, however, and as such boundary effects may be ^ 
present in the reconstruction; indeed, the approach 
described in the Bradley and Brislawn paper attempts to 
reduce boundary effects by reflecting the wavelet filters 
at the image boundaries. Furthermore, this prior 
approach is generalized to 8-bit images (i.e., for 
compression of full grey scale images) . Use of this 
approach for documents, particularly for binary 
documents, will thus be especially cumbersome considering 
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the size of the array to be considered in the image 
compression process, and the eventual compression ratio 
will be somewhat limited. 

5 By way of further background, the technique of byte 

packing as a lossless compression technique is well known 
in the field of computer data storage. According to this 
well-known technique, digital values that may be 
expressed by fewer bits are packed with other similar 

10 values into a single byte. For example, if eight bit 
words are used to express a stream of data having only 
two values, e.g., 2552 and Oj, byte packing will allow the 
data stream to be expressed with a single bit per value. 
Each byte (eight bits) will thus be able to express the 

15 same information as eight bytes of raw data. 

By way of further background, our copending 
application S.N. 08/181,663, filed January 14, 1994, 
entitled "Method and Apparatus for Video Image 

20 Compression and Decompression Using Boundary -Spline- 
Wavelets", assigned to Houston Advanced Research Center 
and incorporated herein by this reference, describes a 
wavelet analysis -based approach to video compression, 
particularly in the field of motion pictures. This 

25 application S.N. 08/181,663 also provides a discussion of 
the background theory of wavelet analysis, and its 
application to image compression. 

It is an object of the present invention to provide 
3 0 a method and apparatus for compressing digitized 

representation of documents to a high degree, in such a 
manner that the resolution and fidelity of the received 
document upon decompression is maintained. 

35 It is a further object of the present invention to 

provide such a method and apparatus in which the 



wo 96/09718 



PCTAJS95/12030 



7 



decompressed document may be magnified to an arbitrary 
extent . 



It is a further object of the present invention to 
provide such a method and apparatus in which boundary 
effects are effectively eliminated in the compressed 
document, thus improving the quality of the compressed 
document and also enabling zoom-in and zoom-out 
techniques to be applied to the reconstructed document. 

It is a further object of the present invention to 
provide such a method and apparatus in which the 
compression may be performed by integer operations, 
increasing the speed of the compression and enabling 
relatively low cost computing equipment (or hardware) to 
perform the compression quickly. 

It is a further object of the present invention to 
provide such a method and apparatus that is suitable for 
storage and communication of compressed digitized 
documents . 

It is another object of the present invention to 
provide such a method and apparatus in which the 
compression technique is selectable. 

Other objects and advantages of the present 
invention will be apparent to those of ordinary skill ^in 
the art having reference to the following specification 
together with its drawings. 



